SEQ ID NO: 35 



A C G -A---. » _. A A. G 
.--Av_. , •„ .-\^**_* A . 



GACTGTAATC 
AGAACATAAC 
ACTGTGTCTA 
.AATGACATTG 
AGCATGCC-CA 
GCCAA.CCCAG 
ATATC-TTATC 
C-C-TTTGTCGT 
ATGTTCGGGA 
TAGAAGCACT 
CGTC-TCTTGC- 
AT.AA.GCAGTA 
ATTTCGACAA 
CAGTAGAACA 
CTCATGGTTG 
AACAATAGAA 
GTCTTGCTTC 
GCAGCTCTAA 
ACTGATCGAG 
TTTTGA.GA.GA 
CTCTGGAGTT 
ACAGTATGTA 
GTCAAGCAGT 
GATGAACTGG 
GAA.GAACATC 
CA.GCCATAGA 
GGAGATCAGG 
ATGGGCAGAC 
TTGACAGCAT 
AGACTCAA.CG 
AAACAACACA 
GCAACTAGTC 
ACTTAGGATT 
GAATCCAACC 
A-AACAATCAA 
CTGACATCTC 
TCCCAAGAGA 
ACTTAGCACG 
AGG.AA_AA.TAG 
C-AA.CGTGCCA 
GGGAGGATAT 
TCACTCGAAG 
A.TCCAGGAAG 
TACTAAGA.GG 
GAAGTGATGC 



•-•\_-AAGAGAC . 
T AAAG AAC T T 
ATGTTC-AGTC 
G.AA_ATCAGCT 
CATTTGCTCT 
GCT 

AGG T GG A 
AA-TTATATTT 
TACATGATAG 
C.AA.GACTAGA 
GTGATCTTGA 
TCTACAA.TCG 
AGCCCTTATA. 
TATCAGGATT 
GATGGAA.CA.G 
.AA.TTGGATCA 
AAA.CACTGAT 
AAGAATATA.C 
ATTTTTCAAC 
CTCTGTCTA.C 
TTATATCTAT 
TCCCGTGCAT 
ATGCGATGGG 
ACAGGAAGGT 
GGCACGTGA.T 
GGGTCACACA 
AGCAGTTCA.G 
AATGGCGATA 
ATCAAGGAGA 
GAAACCGGGA 
CAAAACTGAA. 
AGAAAAGGAA 
AATCAAACTG 
ACAAAGAGAT 
AATGGAAATT 
ACAACTCAAT 
ATCATGGATT 
ATCC-GCCCTC 
AC ACGGC AG A 
ACAATCTACC 
TGGACCAGCT 
CAGAGACAAA 
AGGAGAGGAA 
AATCTCCAGA 
ATATTGATTA 
GAAATGCGAC 
CATTCCTCCA 



i A.C_,GAA_ 

tatt::-Av 



----- A ~ _ „ _ 
AA.CATCAAA7 
A.G A_AA.C- ACT C 
GAGATGGTTT 
GTA.TGATCAA 
AGGATCTTGT 
ATCCAAC-TTT 
GA.GGA_AA.GGA 
TTA_A.ATCC.AC- 
ATTATGAGGT 
AACAATGAAC 
AGATTGTAGG 
ACAATCAGAT 
CCTTAGACCG 
CAAA.GGGGCC 
GGTGAGTTTG 
TGTAGCAGTT 
CTTATCTGGA 
GCCGAGTCGC 
AGAAGCCAA.G 
A.TACAACCTT 
GA.TGAAGAAG 
TGAGCCTCGC- 
ATGACAA.TCA 
CAAAGAAACA 
ACAGAGTGAC 
AAATA.GATGA 
GACCACTATC 
ATCCAATCCA 
CAA.CCA_AA.GA. 
CTTGGGAAGA 
GACATCATTC- 
CAGCAATGAA 
AA.CCTG.AA.TC 
AA.CA_AA.AA.TC 
A.G A.T AG A_AA.T 
GCAGCCCAGA 
AGCAGCCCAG 
CAATGAAGTT 
A-ATTTAAAGA 
AC A_AA_AC AAG 



. .-A.<~;v_'V--'- 

ATT TAG - — • 
TT A TTCCCC-G 
A T AAC AG ATC- 

CTCTGTTAT7 

C-GTAC-TAATG 

AGG AA.G AC AG 

ATG.-_AA_-.GAC 

C-ACAATATGT 

TCA7ACTTTT 

GGA.T AA.T AC T 

TTCTTTACTC 

TCTAGTGTTG 

CCCAACAGAG 

ACAGGCAGGA. 

.AA-ACTACATC 

ATGGCATTGA 

GATATCAACA 

ACGTGCTCCT 

CACCAGGCAA 

GTACAAAACA 

TA.TTGAAATG 

AGATGAGTTC 

CAAAGCTTGA 

TCATAAGCCT 

CAGGGCAGCC 

TCA.TCCATAG 

AACTGAATCA 

TCAGAGACAG 

CCGA.GA.TCAA. 

TTTGTTCAGT 

ACCAGCAACA 

GAGACGGAAG 

TTCA.TGGAAG 

GGGATCAGGA 

A-ATTCATACT 

ATCAA.CACA.G 

CAA_AA.CA.ACA 

GACAC-TTTGC- 

GTTAA.TCAGG 

TAGTAGA.ACT 

ATCCTAA.CAA 

GGAGAGATGG 

TGTTCCAGTC 

AT GG .AG AC GG 



GAAATTCTAA 

v_G _ Av_- , _--_ AG G 

CCAAA-AA-AAC 

AG AA T G A T A_A 

AA.TGAAAAGC 

AA.7C-GC T T AT 

CAGATGTTA.A 

.--AA.TA.TGGTG 

AACTGATTGG 

TGC.-_AA_-. TGG 

GGATATCCAT 

TGTTAA.GGCT 

GGTTAGAAGC 

A.GCGGTGATG 

CTTGGT.AA.CA 

ATGATCTGAC 

AGAGATGCAG 

GA.CTAGAATG 

GACTCAAGGC 

TTT.ATATGCA 

CTATCCTGCC 

AGGCCATGCA 

TTCCAACTTG 

A.ATATTAGAG 

AGAA-ACACAT 

ACAGGGGGAT 

T G AA.TCC AG A 

TTCCTTATGC 

ACTACAGAAA 

GCTGAACAAA 

CTGACATCAC 

GCATTCGGAA. 

AGTAA.GAAAA 

GACAAA.TCCA. 

ACAATGTTCA 

GATAAATCAT 

CAGCACCGAC 

GAACCACAA.G 

GAAA.CAA.GCA 

GGCATCACA.C 

AGACTGTACA 

GAGACTATGG 

TGGAACCCAA. 

ATAAGGACTC 

AA.GGTATCAG 

TGATGATGGA. 



TOG 
7 50 
•5 00 
c50 
900 
950 
1000 
1050 
1100 
1150 
12Q0 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1300 
1350 
1S00 
1950 
2000 
20 50 
2100 
2150 
2200 
2250 
2300 
2350 



Figure 1A 



SEQ ID NO: 



GCC~ 



G.-\>_ LxCC", A 
GC-CCAAGAAA. 
AAAGGGGTTG 
CTACGAACTC 
G G AGG AC G- AA. 
AA.GAGAA.7CA 
AGAATCAACA 
CAAIGC-GACC 
ACGGA.TGGAA 
AA.GGGCG ATT 
AATTAGACCT 
AACAAIGCAG 
AGGAGTGTCA 
TATTAAGTTT 
CTAATTGAGA 
AAA.T C T T AAA 
CTAGCGGGAG 
AAG AAAGT C A 
CATCCCTGAC 
AGATCAAATC 
TTAGTACCTA 
TAACAACAGC 
TCAA.GCTCTG 
AATGAGGATG 
AAAACCAATA 
CCAATCAACA 
TTAGGATGAA 
CCACAATCTA 
GAGCCGTTAC 
TAGGGTTGTC 
ATGTCTTTTT 
GGGAGCATAA 
TGGATCATTG 
TCCTACAGGC 
GCTACGGAGA 
TCCATGGTCC 
TTGCACTTGC 
GTGATATTTG 
CCCTAAGTCC 
T AC AA G T AC A. 
CAGATTCTAG 
CGGGTTGATC 
AGCAGAAGAT 
GGGATCAGCT 
AAGTCAATTA 
ATCCACACTT 



35 

TCTATCAG 
AGACTAGATG 
GTATCAATCA 
GAAGGCCAAA 
AACTTCAAAG 
T AC AGG AG AA. 
CATCC7GGAA 
AA.CCAAACCC 
AAGCAGAACA 
AGGAAAGAGA 
ACATTATTAC 
ATACCAAGAC 
ATACTGCATC 
ATGGATCATG 
GAAAA.CTGAT 
ATCAAAAAGA 
ATT ATGACAG 
GACATCCATG 
GGTTTGACCC 
CTCTATAGAT 
AGAAATAAAC 
GAAGAATAAG 
AATTTATCAT 
CAAGAGTGAC 
TCAGCTCCCA 
GCACAAAACA 
CATAACAAAG 
AGGACTAATC 
CACATTCCCA 
CACTCAAGGT 
AAGATAGGAG 
ACTGGGCTTC 
GTGATCTAGA 
CCACTTGGGT 
TGCAACCAAG 
TGATAGTTTA 
AGTAGATTAA 
TCCTCAATGT 
TGAACTGCAC 
ATGGCATTGT 
TATCAAAACA. 
ATGAAAAAGG 
AAAA.GGAAGA 
CG AG AAG AT G 
TCCACC-TCAA 
GCATTCAAAA 
AAATTCAGTT 



TAGA . TGA 7 
•■"A-OAA.GAA.GA 

ACACCCCAGA 
AG AC AC AC- AC 
GATCGAAGAA 
CCAACAAGAC 
CATCC7CAAC 
ATCACACAIC 
ACCTCCGAAC 
GG A.C A.C AG A--. 
A.GAA.TC77GG 
AAG AG AG 7 7 G 
AAAG AT AG AC 
ATACCAAA77 
CTTAAAAAGA 
A. C AA T T A I C A. 
AGAGAGGAGG 
ATCAAGACAA 
TCTTATGGAA 
CAATAGAGAA 
AGATTGAATG 
CAGTACAATG 
CAAAAGCAAA 

GAGGAAGTGT 
GTAAACCGCC 
GCCAATCAGA 
ATCTCCAGAT 
AATCCTCCGA 
GAATCCTCTT 
CAATGAACAG 
A.TCCGCCCAA 
TTTGAGATGG 
TGATGATCCA 
TGGCTAGATA 
CTCGATATAG 
CACTGTACAA 
GAAAAGGGAT 
CTTCCACTAG 
AGCAATTGGA 
TATCATTGCC 
GGAGTTCAGA 
T G AAAAAT C A 
T GGGC AG AA7 
AGATTATTAT 
CGCAACTGGC 
GAGAAATCTG 

ATATnnnr.AT 



A C A G i v.- AC AA. 
AAACAA7CA7 
G AG C C AG AAA 

U . l - A^j l 

.-. ^rv_ v_- 'o Au i w o 
AA.c CTwvj v.* i o A 
C AAG G A T C A--. 
GAGAGCACTC 
TG7AATCCAA 
G7G7GTGGC 
- 7 C C 7 AG C AG 
AAA. T C AG AT T 
TGGATGAATC 
CTC-ATCACAT 
GAA.GAAGGAC 
AAGCAAAAGA 
ACACAGGGCA 
AACACCAGAA 
ATGAA.TCCAA. 
AGATCATTAA 
GCAATCATAC 
CTGAGTTGAT 
AACCAAGGGT 
GACCACCCCA 
CATAGATGAT 
AACAATGAGC 
TCTCCGAGAA, 
AGAAAGGCCA 
ACATGGATCC 
AAA.GGTCAAA 
AGTTACAAGG 
CACCGGAAAT 
AAGTAAGAAG 
AACATCAAAC 
GTTATTTGAC 
ATAGAGGGAT 
TCAATAACTC 
TAATACAATA 
CAGATTCCAA 
CTAAATTTCA - 
GTACTCAGTT 
TCTCATTGGG 
TCTATATCAA 
CTATCCCCTA 
CATCAGTTGA 



AA.'— AA.v_ AA.C A 

GGGA-^.TT-AAA. 

G AAT ATTGG A 

ATCCTCAAAG 

A7CCCAGGGG 

GG AA.7CGAA.C 

CA.GAA.CCACA 

GACACAAAAG 

GATTTACAGA 

7C7GCAGCAA 

GAATGTCCTA 

GTTTGATGA7 

CAGAA.CGAGA 

ACATAGAAGA 

CA.TTAA.TCTC 

CAACCAGAAC 

AGAGAAAATA. 

TCGAGAAAAA 

AACGACACAC 

TGCCACTAGA 

TAATAATCAT 

ATCAACGAAC 

GG AC ATGTTC 

CAACACCAAG 

ATACACCAAA 

TAAGAAAAAC 

ATCACCAACT 

TGGCAACATA 

TACCTCATAT 

AGATATCTGG 

AGACAGGTAT 

TTTGTGGCTC 

GAT C AGG AA.C 

AACTGTAAAG 

CTGAACTATA- 

GCTAATAAGG 

AAAATTCAGG 

TATTCAAAAT 

TC AAT AAA.T C 

AGGAGTAGTT 

TGGTTCATCT 

GAATATTGTA 

ATTAGTTGGA 

AGACATTAGC 

ATGGATCTGA 

AATTACAAGG 



2 7 5 : 

: = :c 

2 5 50 

2 r-;0 

2 5 50 

2 COO 

2050 

2100 

2150 

2 200 

2250 

2200 

2250 

2400 

2450 

2500 

2550 

2600 

2650 

2700 

2750 

2300 

2350 

2900 

2S50 

4 000 

4050 

4100 

4150 

4200 

4250 

4300 

4350 

4400 

4 4 50 

4500 

4550 

4600 

4650 

4700 



Figure IB 



_SEQ ID NO: 35 
■ 3 . AC- A 7GG AG 7 T C T Z C AC-CC T 7 C A - G G~ 
AAACATCATA GC.W-a.GGGG TCGGGAaIaaT 
CC7CATATCC AACATTGCAA ATCAGGCTAG 
AC7TAGGATC AAA.GGGATCA CCACGAACCC 
CAACACACAA AT C AC AG AC A. AAAA.GGAGAA 
AA-AACAGAA.C GCACACAACC AA.G C AG AG AA. 
CAAACACACC AACAATCCTG CA.iACAA.GC A 
CAAAGAGCAC CAGATATGAC C AT C AC AA.CC 
AATACCCCCA TCATTTTGTC AAA.T AG AC. 1 ~" 
GTGTGTTAGT CAACAATCCT AAAGGCATGA 
ACC-AGATACC TGATATTAAG TTTGATACCC 
ATGTGGGGAT CAACAGATAA ACCAATACAA. 
TAATTCCTCT ATATGATGGA TTAAAATTAC 
AG T C AT G AAA. CCCACAACAA TACTAA7CTT 
AGAGATAATT GGGACAATTG CGATAGGGAT 
CCGCA.GCAGT CGCTCTTGTC GAAGC7AAAC 
AAAC T C AAAG AGGCTATAAG AGACACAAAC 
AAGTTCTGTA GGTAACCTAA TTGTTGCAGT 
TCAACAATGA. AATTATACCT TCAATCACAA 
GGGTTACAAT TGGGAATTGC AT T G AC AC AA. 
TATATTTGGT GATAATATAG GAACACTGAA 
AAGGGATAGC ATCATTATAT CACACAAACA 
TCAACAGTTG ACCAATATGA TATTTATGAC 
CAAGATGAGA GTGATAGATG TTGATTTGAG 
AAGTTAGACT TCCTTTATTA ACTAAACTAT 
GTAGATTCTA TATCATACAA CATCCAGGGC 
TCCCAATCAC ATCATGACAA AAGGGGCTTT 
AAGAATGCAT AGAGGCATTC AGCAGTTATA 
TACATATTAA ATCACGAGAT AGAGAATTGT 
GTGTCCTAAG ACTGTTGTTA CATCAGATGT 
TGAATGGTGG ATTAATTGCA AACTGCATAA 
GG AA.TTGACA ATAGAATTAA TCAATCACCT 
AA.C AC AT AAA GAATGCCAGG TAATAGGTAT 
CTAATAGAGA AGGGACATTA GCAACTTATA 
AATAACTCTG TTGCACTTAA TCCAATTGAT 
GGCAAAACTA GAATTAGAAG AATCGAAGGA 
AAAAGTTAGA TTCCGTTGGA AGTTGGTATC 
ATAATCATAG TGATGATAAT AATTCTAGTT 
TGTAGTCATA ATCAAATTCC ATAGAATTCA 
AAAACAGTGA GCCGTATATA CTGACAAATA 
TCAAATATAA AAAG TA.C AAA AAACTTAGGA 
AGCACCGAAT AGACCAAAAG GCAGCGCAGA 
TGGAATATTG G AAA.C AC AC A AACAGCATAA 
GAAACAGCCA GAGGCAAACA TAGTAGCAAG 
CACCTTCTGG ACAATAACAT TAACAATATT 
T AT T G AC AAA- CTTAATTCAA GAGAACAATC 
G AAAT AAG AA AAGAATTCGC GGCAATAGAC 



o ■ ■ _ . J . _ _ „ 






G A 1 — .-.\-* - 






v- A.*— A^*vj : 


A-^-AT G AAA,-. 


~ _ — „ 


C G G AAAAC AG 


G G .-AAG AAA.G 




G G- G AG T GG AA 


AG G G A.ij A--. 




AAGGGAAAGC 


CCGCCATTCA 




*v- V- AAAA^ A.vj . 


GGTGAAAAGA 




aCAATCATAG 


GGATATTACT 


3 ICO 


AACAAAAC7G 


CA-ACGTGTAG 


515 0 




A-^-ATTTCGA-A 


3 2 C 0 








G - iG 7? -^ r G 




5 3 C C 




— — * — i • 2* ? i 


^ " = C 
^ — » ^* 






5 -1 0 0 


-GCC-C^TC^ 




54 50 


-G^C-^C^C 


£G a CA T -G a 1 


5 3 0 0 


AAGGCAGTAC 


AATCGATTCA 


5550 


ti a iTCA.GTT 


CAAGACTATG 


5500 


G*TT AGGCTG 


TGAAGCAGCA 


5650 


raTT^cTCAG 


A^TTAACAaa 


5700 


ana ana *GGG 


atf AAAATTAC 


5750 


Ta ACGGAaaT 


aTTT anT anT 


5300 


CTAT^ 1 ATTC A 


CTGAGTC AAT 


53 50 


TGATT a CTCA 


A T TACTCTTC 


5900 


CA AATACTCA 


a ATTT AT a AA 


5950 


aa.aG^GTGG' 7, 


a^ATTCCTCT 


6000. 


T CT a GGTGGT 


GGTGA.TATTA. 


60 50 


TA^GTCCTTC 


^GATCCAGGT 


6100 


TTA.TCA.GGGA 


A.CATAACACA 


5150 


GGTAGGACGA. 


TACGCGTTTG 


6200 


GAACTACATG 


TA.CATGCAAT 


6250 


GaTC aa GGAA 


T T AAG AT C A.T 


6300 


a a arGGAATG 


TTATTCA a TA 


6350 


CATTTGATGA 


CATC AT ATT A 


6400 


ATflTGTATGG 


a aCTCAAC A_A 


6450 

yj *i <j w 




a a a tp a a atp - 






aanaaTnann 




aTaaTnaaTA 


t a ana att at 


o n n 




c a aa ann Ana 


w w ^ U 


nana aTa ana 


nTaTananna 




ana a anTTnT* 


Tna ananAnn 


67 ^0 


nnnnananna 


a anTna a a a a 




ATAACACCAA 


CAATGAAACC 


6350 


GTTACAAATA 


TCATAATGTA 


6900 


ATCAGTCATT 


TTTATAATGA 


6950 


A.TAATAAATT 


AATGTTGCAG 


7000 


ACCAAGATTC 


AGAGGACTTC 


7050 



Figure 1C 



SEQ ID NO: 
GG.-.rG AC ATT 
'~ AA. T T C AG AG 
ATGTCAGATG 
T C AAG AAG T G 
7AAJVTCCAAA 
AGTAGTCCTA 
TACT AC AG T A 
TAATCTATGC 
GGGAAATCTT 
CCTAGTACCT 
ATAATAGAAG 
TTATGCTCAA 
TATTGAGGAT 
C^A.CAAGGTT 
TTGTATCCAT 
TCTCGGATAT 
ATACAACTGG 
TATAGCCCAT 
TGATAAAGGC 
TGAGCCAAAA 
AGAATATACA 
AGGGGTAATT 
GGCATAATGT 
TCATGCCCAG 
AAACCCATCG 
CTAGAGAAAA 
GAATTAGCTA 
TTGTATCA.CA 
ATCACAGAAG 
C C AAAAAA.C T 
TTAAAAGAGA 
GAAATAATAA 
GTCCCACAGC 
ATTCTCCTAT 
TTGCCTCAGC 
AC AAAAAAT T 
AATTAAGATC 
TTTATCAGAT 
CGGAATTAAT 



AT AA.T C AA-AT 
AAG T T AGC AT 
TAGCAATATA 
ATCCATTCAA 
AAAGCCAAAA 
AGAAGACCAA 
TAGATAAACA 
ATGTATTGTG- 



35 

GGAACCTC-A. 
TCATGTTCAA 
T GAG AAAA.T T 
C C AA.T AC AG A. 
CAAGTTCTGG 
AGATAAGGTT 
AAIGGCTGTA 
TTACACCTCT 
ACCAAGTACT 
GATTTAAACC 
ATCTTGCTCT 
CACCAAAAGT 
ATTGTACTTG 
T AC AAAT AAT 
CAGTGGGACC 
GGAGGTCTAG 
TTGTCCTGGC 
GGTTCTCAAA 
ATAGATGCAA 
TTATTGGGGA 
TAT AT ACT AG 
GATATTTCTG 
ACTATCACGG 
ACGGATGTAT 
GGGAGTGTTG 
CCCAATCATT 
T AT AT AAC AG 
CATTATGATA 
TTTGAATA.CG 
GCAGCTARAT 
CCACCAGACA 
AAAACTTAGG 
GGCACAACAT 
AGTTAAAGGA 
CCTACGATAT 
AAACTCAATA 
AGTCTTAATG 
ATCCAGAGAT 
AATAAAATAA 
GACTGATGGA. 
C G AAAAAT G A 
TCAAATGTTC 
GACATGGTTT 
ATGAGATTAC 
AAGAATATAT 
AAATTACAAT 
ATGTAGTTGA 



- AC ACT JAGG 
AACTATA7'~C 
TATCAATGA7 
G AA.T G AC T C A 
AGGTGTACAi 
AATACCAGGA 
TTAGAATTTC 
AATCT7AT7A 
ACAAATAGGG 
CCAGAGTCAC 
CTGGCACTAT 
TGATGAAAGA 
ACATTG7CAC 
AAT AT AAC T T 
AG G AAT C TAT 
AGCATGAAC-A 
AAAACACAGA 
TAGGAGAATG 
CTTTTAGCTT 
TCAGAAGGAA 
ATCCACAAGT 
ATTATACTAA 
CCAGGGAATG 
AACAGGAGTT 
TA.TCATCAGT 
ACTTACTCAA 
AACACTTCCA 
AAGGGTATTG 
TTTCAACCTA 
TGATCATCGC 
GACAACACAG 
AGAAAAGTGT 
CTGACATTCT 
AAGATAGCAC 
GGATGATGAT 
AATTAGATAA 
GAAAGAGTAA 
GTCTAGTGAA 
A.TGAATTGCT 
TTAAGAGATC 
TGGAAGTAAT 
AC AT G AC T T A 
ACTATTAAGT 
AT T C AA.T AGG 
TGCTGATACA 
GGGTATATAA 
AGGGAGGTGG 



1 -.-*.•,_. , .-. _ - 

CTAA7AAATA 

Vj A AvjAvj ruT 

C~QG~.-J-.CZC 
CCAGC-77TAT 
AT C G 7 7 AC- T A 
CCC AC-C-GC TC- 
A7AA77AC7A 
AC AT AC AT T T 
TGAAIACAC-A 
7CCGATTATG 
TAATAATGGA 
TTGATAAACC 
TAT AAGG AT A 
AAACC-GAGAC 
GAGACTGTAA 
GTAAACTCTA 
GAGGGTGTGG 
GATTACTTTT 
TGGCACAGTA 
TATAAGAATA 
ATGAATGTCC 
TACACTGATG 
AATTCTTGAT 
CAGCT AC AAA 
GCTGCATATA 
TTTTCATATA 
TGTTATTCAA 
ATATCGGATG 
GAGACGA.TGC 
GCAAGAAAAA 
GTACCCTGAA 
AACTGCATAC 
TCAATACTGA 
AAGACAACGG 
GTGATCTAGG 
ATGTTCCAAT 
AAGTAAA.GCA 
TATGGGTTAC 
TATGATATCA 
TCAATCAGA.C 
ATGACATGAG 
CATAAAGATT 
TCCAGAACTC 
TGACTCCTGA 
AATATAAGTT 



AACAC AAC AA ' ■" 1 5 0 

a--_aga.c-.-_-.ca ~ : ; ; 

ATAC-AA.CCCC "1;^ 
ATCTCTAACA ~:-00 
7 AGC AA.C AT C ~ _ 3 * 

atcaa.tcatc 
7c aa.g.at ata 7 4 50 

T .-_-_-.T T C GG A 7 500 

A_AT A.T T G ATG 7 5 50 

TC-TTTATCAC- 7£GG 

C AT C .AA.C AGG 7^50 

TTAA7TAT.AA 7 700 

GT ATGCAGC A 7 7 50 

AA.GTTATATT 7:00 

G T AAT A.T G T A 7 S 3 0 

TCAGGCTTCT 7 900 

TTATTGTTGT 7 9 50 

ACT A.TTCCAA 3 000 

ATTAGGTGAC 8050 

AATTACAGTT 8100 

AATTGGACTT 8150 

ATGGGGTCAT 8 200 

CATATCCGCT 3 2 50 

TCACAAAAGT 3 300 

TAGAAT AAAT 8 350 

C AACAACAAA 8 400 

GTAGAAATAA 8 4 50 

AACAGAAGTT 3 500 

CAAGATGACA 8 5 50 

AAGAT AT AAA 8 600 

TGGACACCGA 3S50 

TGTCACCTCA 8 700 

AAT AATGAGT 3 7 50 

TTATTACTAG 8 300 

TCAATTAGGA ' 83 50 

TAAATATACC 8 900 

TATGTATACC 8 950 

A.GTAAAACAT 9000 

T A.T AC T A.T C G 90 30 

ATG AAG AT AT 9100 

AAATGGTATA. 9150 

AAGATTACAA 9200 

AT AATCT ATT 9250 

GTCTTAAT AT 9300 

ATTGGTACTA 9350 

CATGTGCAAA 94 00 



Figure ID 



_SEQ ID NO: 35 
AT A 7CCT AAG77ACAA7 C.AA.TGTAT 7A 
A_AA.7A_A7AGA 7GGAC7A7TC TCGACCTT AG 
ATATCAC7.A7 7AGAACCAC7 TGCA77.ATCG 
GGTTAAAC.AG C7GAGGGGGG CTTTTTTA.AA 
A_A7TA_A7.A7T TGGAGGTGAG 7 G 7 A G A_AG A.G 
T A7 AT AGA.T A AAA.TTTTAC-A TGTC-TTC.-_AA 
AGGA.GA_A.A77 TTC7CTTTC7 TCCGA.AG777 
CC-AG7A.7AGG A.GCA.GA.GAAA G7TAGAJAA.G7 
T7GA.aji.7TTG ATACTA7CAA. T AAA7GTC A. 7 
TATAAATGGA. TATAGAGAAA GACATGGTGG 
7ACCTGTCGA 7GCACATGAA TTTATCA7.AA. 
GGGATATCA.7 "ATGAGAATGC 7G7AGATTA7 
A_AAA.TTTG.AC AAGTTTATA.G AGCCTCaJ.TT 
ATATGAAAGA TAAAGCATTA TCCCCAAAGA 
TATCCAGCTT CAAACCTGTT ATACCGGA.C7 
A-AC-A77GG77 GAAC-TATTTA T A.GC AGA.T A.G 
T AT T -AG AT T A. CGTAGAATCA GGATATTGGG 
ATCTCATATA GTTTAAAAGA GAAAGAAATA 
TGCAAAAATG ACATACAAGA TGAGGGCTAC 
TATTGGCGAA TAATATAGGG AAATTCTTCC 
GGAGAAATTG AATTACTCAA GAGACTAACA 
TCCGCGGTAT AATG AGGT AT ACAATAA.TTC 
TTCAAGCTTA TAATGCAATT AGCAGTTCCA 
TCAJAA.GAAGT TTGAATTTAA ATCTACAGAT 
AACCGTAAGC TGCTTCTTAA CGACAGATCT 
GG AGGT AT G A ATCAACAGCT TTATTCGGTG 
GGGTTAAAGG AATTATTTAA TTGGCTGCAC 
AATATATGTT GGAGATCCTT ATTGCCCGCC 
CACTTGATGA CCATCCTGAT TCAGGATTTT 
GGAATAGAAG GGTTTTGCCA AAAGTTATGG 
AATACATTTA GCAGCTGTCA AAATCGGTGT 
AAGGGGATAA TCAAGCCATA GCTGTTACCA 
GA.TTATAAAG TTAAGAAAGA GATTGTTTA.T 
TGA.TTCCTTG AGAGAGGTGA TGGATGATCT 
ATGAAACTAT AATAAGTAGT AAAATGTTTA 
TATGACGGAA GAATCCTTCC TCAGGCATTA 
TTTTTGGTCT GAAACAA.TCA TAGATGAG.AC 
TGGCTACATC GT TT GCAAAG GCCATTGAGA 
GGATATGTAT GCTCAATCTT CAAAAATATC 
TGGAATGAAT ATAAACCCAA CTATAACCCA 
TCAGGAATAT TCATTGGATG CAATATGCCT 
GGAGGATTTA ATTATATGGC CATGTCAAGG 
AGATCCTACA GTCGCTGCGT TAGCCGATAT 
ATTTGTTAGA TCGAGGTGTC CTTTACAGAA 
GAGTCTTCTT TTTTAGACTG GGCCTCAGAT 
ACAATCTCAA AATATAACCA CCATGATAAA 
TACTACAGGA CTCACCAAAC CCATTACTA.T 







t z. . 


L- .-. w A- Aw 


A7TTGACATA 




C : CAT . CAAA. 


CTTATGACCC 


» — - - 




7 C AG AA AT G G 


9.-.T} 


AGGAAA.7ACC 


i .-A. iG : A T 


9c 50 


GAA.TCAACAA. 


7AGA7GAAA7 


T j 


, v.- A - — ~ 


C : ^ A T T A G A. G v_- 


} > 2 J 


A7A7G~-~AG 


TGAG.AAATGG 


5-500 


GC7.A7777TT 


G 7 AG A_AT A_AT 


9-30 


TCAA.TGGCC7 


GCAGT7ACA7 


9 900 


AT G C AT Av_ G v_- 


ATC.AAA.TTCT 


99 30 


TATAAGAGCT 


TCAT.AGGAA7 


10000 


G G AT G AA.G AC 


T7A_-.CTA.TT7 


10C30 


AATCAAACTG 


GGACACAGTC 


1G1Q0 


AATGTGTCTC 


ATGATTCACG 


10150 


TAAATTTGAT 


CCCCACCA-AC- 


L02G0 


■TGGATGATCC 


TGAATTTAAT 


L0230 


AAA.CAAGAAG 


GTAGA.CTTTT 


10300 


ACAAGTATTA 


TCAGAAACA7 


L0350 


AAGAGAATGG 


GATGGTTAAA 


L0400 


ACAATATCTA 


TGTCTGGAGT 


10 4 50 


AAAAAGTCAC 


ACAGAAGAAC 


10 500 


ATTTA.TCTTC 


TAATCAGAAG 


10550 


ATATACAATG 


ATGGATACGA 


10600 


TAAAAAATAT 


TGTTTAAATT 


10650 


ATACTTGTAA 


TCAGATATTT 


10700 


CCTCGCCTTG 


AAAAGAGTAC 


10750 


AT C AG AT AT T 


GAACATTTAC 


10300 


ATGTTCATAA 


TCCTAAAGGA 


10350 


ACACTCATAT 


CTATCAGTGC 


10900 


AAGAGTTACT 


GCAATGGTTC 


10950 


CAAGAGTACC 


T AAT AAT T AT 


11000 


AAAGA.TGTGG 


TAAGATTTTT 


11050 


GGGTCATGAG 


CTCAAACTAA 


11100 


TATATAGCAA 


AAGGATATAC 


11150 


AAAGCATTGT 


CTAGATGTGT- 


11200 


AAGATCAGCA 


TCCTCAAATC 


11250 


ATGGCTACTC 


ACCTGTATTG 


11300 


CAACAGTTGT 


ATATAGCGCT 


11350 


AAA.TATTAAA 


GA.TCA_AT.ATT 


11400 


CCTTAATCCC 


TC-CTAGTGTC 


114 50 


TGTTTTGTCA 


GAAACATTGG 


11500 


T AAAAG A T T T 


ATAAAAGCAA 


11550 


TTATGAATCA 


AGAACCAGGC 


11500 


CCCTATTCAT 


GTAACTTACC 


11650 


GAATATAACT 


GCAAGAAATG 


11700 


CTGGATTATT 


TACAAGTACA 


11750 



Figure IE 



SEQ ID NO: 

A 7 G A. 7 A G AA.G 
A A TCCTCCC A 
T T AGG AA7GC 
G7AGGGA7AA 
C A J 1 . C T A T G A T 
TAGTCAGTGA 
A7A7CA77AA. 
AAA7GGAC77 
T AA.G AGG AT C 
CCATATACAT 
GACAGGAA7A 
AGAGATC7GA 
AAGGCTGCTA 
CGAAA7ATCT 
TTACATTGGA. 
CTATCAGACA 
ATGAGTTATT 
CTATTAAAGA 
GTAATGTTAA 
GAGTACAGGA 
GTTGTATAAA 
TTAGAGTTAA 
CCCTTTAAAG 
CT7ATACAAT 
ATATCAATA7 
TCGGGATAAT 
ACAGTCTGAT 
ACATTTGGAG 
GAAAATAGAA 
AAGACACCTC 
CCAAAAGTGT 
TGGTCCTAAT 
GCGAGTA.CTC 
CTTGAGATCT 
A.CAAGCATTT 
TAGCATCTTT 
GATGAATTAA 
TAAATATGTG 
7TACGTATGT 
AA.TCCGCC7G 
CTTAGACAAT 
A.G A.G AAA 7 AA 
G T C G T G AAAA 
GAATGTTACT 
CACATCAGC7 
CTTGAATTAT 
AC7C77777A 



35 

AGG A 7 G AGG A 
AGAG77GGAG 
7A7AGC7GG7 
GGAGAGGAGG 
C77G7ACAA.7 
C AA.G AT7 AAG 
G AC A-A AAA 7 
GAAACTCCAG 
7GAACA77G7 
GGATGTATTT 
GCATCATTAA 
AG C AC AA 7 7 A 
7AAGAA7AGC 
7GGA7GGAAG 
7AGC77AAAG 
GG77AAAAGA 
AGAG7AAGCA 
AGCAAATGAA 
CAGGA7TAAG 
CATAACCC7A 
AGAGAG7TAC 
TCAAATACCC 
GA7A7AGA7C 
TGACATGAA7 
GTACTGCAGT 
C77AAGGAGC 
AACTGAAT77 
GG7TAC7CG7 
GGAAGGGATC 
ACATTCAG7A 
TTAAGAGATT 
AC7GC7AG7C 
C77GGA7C7A 
ATATCTG7GA 
C7C7CAAGAC 
7GGACCAAAT 
AACAA7AC77 
CAAGTATCAG 
AAGG AAAAC 7 
AAACGATTGA 
A77G77AAAA 
AAG 7 AG 7 7 AT 
TAAGATCCAT 
ACACA7GGAA 
GAGG77A777 
CACAAA7C77 
GGAGAAGGAG 



ATTA^-G . Z AG 
ATGACATTTT 
ATG77GGA7A 
ATTAACC7A7 
•A . GA.'^.AwA.'wT 
7A7GAAGA7A 
G7GGA7GCAT 
A7CC777AGA 
AGGA7A7G77 
ACCAGGCAA7 
GGG7CGCT7A 
GGG7A7ATCA 
AA7 G A T A 7 A T 
CA7CACAGA7 
A7777GACAC 
7AC7GG7AC7 
GG77CA7CAC 
AC7AAAGA7A 
7G7A777GAA 
7GG7CA7GCA 
AA7GA7GAGC 
TGAGA.GTAAT 
7 A.7 C AAAA.7 7 
TACTGGGATG 
7 AC AAT AGC A 
7GG77G7GA7 
C7GACCC7AG 
GAA7CAA777 
CCA777GGGA 
C77AAAG7A7 
T7GGGA77G7 
AAG A.7 C AA.G 7 
7 7 T A.TG AG A.G 
7AG7GACA7G 
A7C77GCC77 
77A77AAA7C 
AGA7C7GAAC 
GAC7G77AAT 
GCGA77AA.G7 
AGA77GGGA7 
C7G7 AA A.TGA 
T7C7GGGGA7 
AACGAG7GA7 
7GACAC77CC 
GGAG7AAACA 
AA7GAGGGAA 
CAGGAGC7A7 



77GC7AA7GG 

AGA7AA77C7 

C AAC AAAA 7 C 

AAC7TA77AA. 

7AG7AAAA.C7 

7 Z-7GCTC A.G7 

7 T A.7 C AGO A ! ^.- 

G77AC7G7C7 

AT7CAAC7GA 

CTTAATATAG 

CTTTGC-ATCA 

AAAATC7AAG 

ACTTGGGCAT 

TGCACAAACA 

CAG7GACAAC 

CAGATGAAAT 

AATATCTAAT 

CAAATCTTAT 

TATCTATTTA 

TCTACATATA 

ATATCAATCC 

GAA.TTTATA.T 

AATGGTTATA 

ACACAGATAT 

GAT AC AAT GT 

TGCAAATGAT 

ATATACTAGT 

GCATATACCC 

TTATATAATG 

TATCTAATGC 

GGAGTTTTGA 

TAAGCTTGCT 

AATGGTTGAA 

GAAATAGCAA 

TGTGTGTTGT 

TAACATATCT 

ATCAAAGAAG 

TAAATCATTC 

ATCTGAGGAT 

CCCATAGAAG 

CAATTGCAGT 

TAGCTCTAAA 

TCTGAAGTTA. 

TCAGGGAGGA 

GTACAAGTTG 

GTTAAAAAAG 

GTTAGCATGT 



C T T A C 7 G G A A 

AGTAAT7CGA 

G AAAGA7 AAG 

7 7 AAG AC 7 AA 

A G A C C 7 A G C C 

GAAGAA7GA7 

G G A.G 7 AA. 7 AA. 

AGG7GAAA.GG 

GA7CAGC7GA 

G77ACAGA7G 

CAAACCAGC7 

77GGGAA7GA 

CG7GCAAAC7 

A7CAACAAA7 

777C7AG7AC 

GA7AA7A7G7 

77A7CAACAG 

GG77AGAGGA 

G AGG AT GG A7 

GGAG7C7ACA 

A.7 G A.7 AAGG A. . 

AGAGA7CA77 

7G7ACA7GCA 

CGCAGC7AGA 

GA7GA7A77A. 

G777C7CAAA. 

777A7GGA77 

AGAACA7TAA 

AC7A7C7CAT 

A7CC7A777A 

C7C7CGA777 

7GGAGCA7CA 

A7GACAGAAG 

77AGCAGAGA 

A.G AG AG A.C 7 7 * 

A7CC7AC7C7 

CCC7CAAC7G 

7CG7GG7A77 

A7GAGAA.7A.7 

GATAATCAAA 

GAA77A7CAA 

A7GAAGC77C 

AG77A7C7A7 

7C77AAAGC7 

A7 AAAG A.7 AG 

7A7GA7GC7A 



12150 
12 200 
12250 
12 500 
12 3 50 
12 400 
12 4 50 
12500 
125 50 
12 500 
12550 
12700 
12750 
12300 
12350 
12900 
12S50 
13000 
13050 
13100 
13150 
13200 
13250 
13300 
13350 
13400 
13450 
13500 
13550 
13S00 
13650 
13700 
13750 
13300 
13350 
1-3900 
13950 
14000 
14050 
14100 



Figure IF 



SEQ ID NO: 35 

CAC7CGGTCC TGCAATAAAT TA77A 7AA 77 ."--"TTAAA. 7A77ACAJA7 

GTAAT7GG7C AACGGGAATT AAAAATCTT" G 7A7CAGAAG 7A7CA7TAG7 *'■!_', 

AGGT.AAJVAAA. CTAGGAAATG T AA.C AC AG A 7 7CTTAATCCC GTGA.GGG7G7 L4%5 G - 

TATTTA.ATGG GAATCCCAA7 TCAACA.7GGA. TAGGAAA7A7 GG AAT G T G AG i±lST~~ 

Av_.lTTA.AT AT GGAGTGAATT AA-AIC-ATAAC- TCAA7TGG7T TAGTACATTG L4j;0 

TGACATC-GAG GGA.GGG AT AG GC AAA.TC AG A AGAAAC7G77 C7ACATGAAC 1 4 4 ; 0 

ATTATAG.AT TATTAGGATT ACATA7T7AA TCGGGGATC-A TGATGTTGTC 14-^0 

CTAGTATCAA AAATTATACC AACTATTACT CCGAATTGGT GTAAAATAGT i-«00 

CTATCTATAC AAGTTGTATT GGAAGGA7GT AA.GTGTAGTG TCCCTTAAAA 14 5 50 

CATCCAATCC TGCCTCAACA GAGGTTTATT TAATTTCAAA AGATC-CTTAC 14=00 

TC-TACTGTAA TGGA_ACCCAG TAATCTTGTT TTATC.iAA.i-C T T AAAAG G A. T 14,50 

ATCATCAATA GAAGAAAATA AT C T AT T AAA GTGGATAATC T T ATC A_AA_AA 14 700 

S G i" Z * 5 J" 1 " T CGAGTGGTT -^ CAGCATGAAA TCAAAGAAGC- A.G AAA.GG G AT 14 7 50 

; ATGGGATAA TGAGGCCjvr - a - TCATACAGCA CTGCAAATTT TTGGATTCCA 14300 
A_ATT.AA.CTTA AATCACTTAG CTAGAGAATT TTTATCAACT CCTG.ATTTAA. ' 148 50 

CC """iS* a " TT " aj1 ' TAATA TAATT CAAAGTTTTA CAAGAACAAT TAAAC-ATGTT 14 900 

ATGTTCGAAT GGGTCAATAT CACTCATGAC AATAAAAGAC AT.AAATTAGG 14 950 

AGGAAGATAT AATCTATTCC CGCTTAAAAA, TAAGGGGAAA TTAAGATTAT 15000 

TATCACGAAG ATT AGT ACTA AGCTGGATAT CATTATCCTT ATCAA.CCAGA 15050 

TTACTGACGG GCCGTTTTCC AGATGAAAAA. TTTGAAAA.TA GGGCACAGAC 15100 

CGGATATGTA TCATTGGCTG ATATTGATTT AG AA.T CCT T A AAGTTATTAT 15150 

CAA.GAAATAT TGTCAAAAAT TACAAA.GAAC ACATAGGATT AATATCATAC 15200 

TGGTTTTTGA CCAAAGAGGT CAAAATACTA ATGAAGCTTA TAGGAGGAGT 15250 

CAAACTACTA GGAATTCCTA AACAGTACAA AGAGTT AC-AG GATCGATCAT 15 300 

CTCAGGGTTA TGAATATGAT AATGAATTTG ATATTGATTA ATACATAAAA 15350 

ACATAAAATA AAACACCTAT TCCTCACCCA TTCACTTCCA ACAAAA.TGAA. 15 400 

A_AGTAAGAAA AACATGTAAT ATATATATAC CAAACAGAGT TTTTCTCTTG 15 4 50 

TTTGGT 15 ^ 55 



Figure 1G 



SEQ ID NO: 
A < _ k_ AAA. ^_ A_A. G 
AA.^, . Av-*^* AT 
CACTG7AA7C 
AGAACA.7AAC 
AC7G7G7C7A 
AA7C-ACA77C- 
A ^ A . Gv-'c C A 
C-CCAACCCAC- 
ATATGTCATC 
GGTTTGTCGT 
ATGTTTGGGA 
7AGAAGCA.C7 
CG7G7C77C-G 
A7AACCAG7A 
A.TTTCGA.CAA 
CAGTAGAACA. 
CTCATGGTTG 
AA.CAATA.GAA 
GTCTTGCTTC 
GCAGCTCTAA 
ACTGATAGAG 
TTTTGAGAGA 
CTCTGGAGTT 
A.CAGTATGTA 
GTCAA.GCAGT 
GATGAACTGG 
GAAGAACATC 
CAGCCATAGA 
GGAGACCAAG 
ATGGGCAGAC 
TTGA.CAGCAT 
AGA.CTCAACG 
AAACAACACA 
GC AACTAGTC 
ACTTAGGATT 
GAATCCAACC 
AAACAATCAA 
CTGACATCTC 
TCCCAA.GA.GA 
ACTTA.GCACG 
AGGAAAATAG 
GAACGTGCCA 
GGGA.GGATAT 
TCACTCGAGG 
A7CCAGGAAG 
TACTAAGAGG 
GAAGTGATGC 



36 

AG AAGAG AC 7 
7AAAGAAC77 
A7G77GAG7C 
AAAA 7 C A.GC 7 
7A777GC7C7 

AAGAC-C7GGA 
AA77A7ATT7 
TACA7GATAG 
CAAGAC7AGA 
C-7C-A7C77C-A 
7C7ACAA7CG 
AGCCC77A7A 
7A7CA.GGATT 
GATGGAACA.G 
AA77GGA7CA 
AAACAC7GA7 
AAGAATATAC 
A7TTTTCAAC 
CTCTGTCTAC 
CTA.TATCTA.T 
TCC7GTGCAT 
A7GCGATGGG 
ACAGGAAGGT 
GGCACG7GAC 
GGGTCACA.CA 
AGCAGTTCAG 
AATGGCAATA 
ACCAAGGAGA 
GAAACCGGGA 
CAAAACTGAA 
AGAAAAGGAA 
AATCAAACTG 
ACAAAGAGAT 
AATGGAAATT 
ACAACTCAAT 
ATCATGGATT 
ATCGGCCCTC 
ACACGGCAGA 
ACAA7C7ACC 
TGGACCAGCT 
CAGAGACAAA 
AGGAGAGGA_A 
AA.7C7CCAGA 
ATATTGATTA 
GAAATGCGAC 
CATTCC7CCA 



~r.. 



Avj>_- 
* A.7 7 C G AC AC 

7GGACCA7CA 

- - - i v- A 

77777AC-777 

AA.C A7C AAA 7 

AGAAAGACCC 

GAGA7GG777 

G7A7GA7CAA 

AGGA7C77G7 

A7CCAGG777 

GA.GGAAAGGA 

TTAAA7CCAG 

A77A7GAGG7 

AACAATGAAC 

AGATTGTAGG 

ACAATCAGAT 

CCTTAGACCG 

CAAAGGGC-CC 

GGTGAGTTTG 

TGTAGCAGTT 

CCTATCTGGA 

GCCGAGTCGC 

AGAAGCCAAG 

ATACAACCTT 

GATGAGGAAG 

TGAACC7CGG 

ATGACAACCA 

CAAAGAAACA 

ACAGAGTAAC 



. A A. G w .- A_A 
A . 7L. A^- *. o 

a:a-.caga7C- 

77C777AGAC 

C777C-77A7C 

GG7AG7AATG 

AC-GAAGACAG 

ATGAAAAGAC 

GACAA7A7G7 

7CA7AC7777 

GGATAA7AC7 

77C777AC7C 

7C7AG7G77G 

CCCAACAGAG 

ACAGC-CAGGA 

AAAC7A.CA.TC 

ATGGCATTGA 

GA.CA7CAACA 

ACGTGCTCCT 

CACCAGGCAA 

G7ACAAAACA 

7ATTGAAATG 

AGATGAGTTC 

CAAAGC7TGA 

CTATAAGCCT 

CAGAGCAGCC 

TCATCCATAG 

AACT GAAT C A. 

TCAGAGACAG 

CCGGGATCAA 



L? ------- - v- . 

CG i A'_- ^ AG G 
GCAA^A-AAA.C 
ACA.ATGACA.A 



AA.1 
— — ■ . » 



AAAAGC 



AAATAGA.TGA TTTATTCA.GT 
GACCACCATC ATCAGCAACA 



ATCCAATCCG 
CAACCAAAGA 
CTTGGGAAGA 
GACATCATTG 
CAGCAATGAA 
AA.CT7GAG7C 
AACAAAAA7C 
AGATAGAAAT 
GCAGCCCAGA 
AGCAGCCCAG 
CAATGAAG77 
AATTTAAAGA 
ACAAAACAAG 



GAGACGGAAG 
T7CATGGAAG 
GGGA7CAGGA 
AATTC AT ACT 
ATCAACACAG 
CAAAACAA.CA. 
GACAG77TGG 
GTTAA.TCAGA 
TAGTAGAACT. 
ATCCTAACAA 
GG AG AG AT GG 
TGTTCCAGTC 
ATGGAGACGG 



77A7 
CAGA7G77AA 
AAATA7GG7G 
AAC7GAC7GG 
TGCAAAATC-G 
G G A.7 A.' i C C A. 7 
7G7TAAC-GC7 
GG7TAGAAGC 
AGCGG7GATG 
CTTGG7AACA 
ATGACCTGAC 
AGAGATGCAG 
GACTA.GAATG 
GACTCAAGGC 
TTTATATGCA 
CTATCCTGCC 
AGGCCATGCA 
TTCCAACTGG 
AA.TATTAGAG 
AGAAACACAT 
ACAGGGGGAT 
CGAATCCAGA 
TTCCTTATGC 
ACCACAGAAA 
GCTGAACAAA 
CTGACATCAC 
GCATTCGGAA 
AGTAAGAAAA 
G ACAAAT CC A 
ACAATGTTCA 
GATAAATCAT 
CAACACCGAC 
GAGCCACAAG 
GAAACAAGCA 
GGCATCACAC 
AGACTGTACA 
GAGACTATGG 
TGGAACCCAA 
ATAAGGACTC 
AAGGTATCAG 
TGATGATGGA 



4 30 
300 



700 
7 30 
z Z G 
S 30 
900 
550 
1000 
1050 
1100 
1150 
1200 
. 1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1550 
1700 
1750 
1800 
1350 
1900 
1950 
2000 
2050 
2100 
2150 
2200 
2250 
2300 
2350 



Figure 2A 



SEQ ID NO: 36 



A G AG GCCTGG 
GACTGCCGCA 
C-GCC.AAGAAA. 
A_AA.GGGA.G7G 
CTA.CGAACTC 
CCAC-CA.CC-AA. 
AAGAGAATCA 
A.G AA.T C AA.C A. 
CA.ATGGGACC 
ACGGA.7GGAA 
AA.GGGCGATT 
AA.TTA.GA.CCT 
AAC AA. T GC AG 
AGGAGTGTCA 
TATTAAGTTT 
CTAA.TTGAGA 
AAA.TCTTAAA 
CTAGCGGGAG 
AAGAA_a.GTCA 
CATCCCTGAC 
AGATCAAATC 
TTAGTA.CCTA 
CAACAACAGC 
TCAAGCTCTG 
AATGAGGATG 
AAAACCAACA 
CCAATCAACA 
TTAGGATGAA 
CCA.CAATCTA 
GAGCCGTTAC 
TAGGGTTGTC 
• A.TGTCTTTTT 
GGG AGC AT AA 
TGGATCATTG 
TCCTACAGGC 
GCTACGGAGA 
TCCATGGTCC 
TTGCACTTGC 
GTGATATTTG 
CCCCAAGTCC 
TACAA.GTACA 
CAGATTCTAG 
CGGGTTGATC 
AGCAGAAGAT 
GGGATCAGCT 
AAGTCAATTA 
ATCCACACTT 



AA7C7A.7CA.G 
ACACTAGATG 
G7A7CAA7CA 
GAAGGCCAAA 
AA.CTCCA_AA.C- 
7ACAGGAC-AA. 
CATCCTGGAA 
.AA.CCAAA.CCC 
AA.C-CAGAACA 
AGGAAAGAGA. 
AC AT T AT T AC 
AT AC C AA.G AC 
ATACTGCATC 
ATGGATCATG 
AAAAACTGAT 
ATCAAAAA.GA 
ATCATGACAG 
GACATCCATG 
GGTTTGACCC 
CTCTACAGAT 
AGAAATAAAC 
GAAGAATAAG 
AA.TTTATCAT 
CAAGAGTGAT 
TCAGCTCCCA 
GCACAAAACA 
CAT AAC AAAG 
AGGACTGATC 
CACATTCCCA 
CACTCAAGGT 
AAGATAGGAG 
ACTGGGCTTC 
GTGATCTAGA 
CCACTTGGGT 
TGCAACCAAG 
TGATAGTTTA 
AGTAGATTAA 
TCCTCAATGT 
TGAACTGCAC 
ATGGCATTGT 
TATCAAAACA 
ATGAAAAAGG 
AAAAGGAAGA 
TGAGAAGATG 
TCCACGTCAA 
GCATTTAAAA 
AAATTTAGTT 



TACATCT3A7 

v_j AA.V-* AAG A 

ACACCCCAGA 
AG AC AC AG AC 
G A T C G AAG AA 
CCAi.CA-.GA7 

catcc7ca-.c 
atcagacatc 
acc7cagaac 
g g ac ac ag aa 
agaa.tc77gg 
aagagag7tg 
aaa.gatagac 
at g t c aaa.t t 
c t t aag .aag a. 
acaa7tatca 
agagaggagg 
at c aag a.c aa. 
tcttatggaa 
caatagagaa 
agattgaatg 
ca.gtacaatg 
caaaagcaaa 
gaggaagtgt 
gtaaaccgcc 
gccaataa.ga. 
atctttagat 
aatcctccaa. 
gaatcctctt 
caatgaacag 
atccgcccaa. 

TTTGAAA.TGG 
TGATGATCCA 
TGGCTAGATA 
CTCGATATAG 
CACTGTGCAA 
GAAAA.GGGA.T 
CTTCCACTAG 
AGCAATTGGA 
TA.TCATTGCC 
GGAATTCAGA 
TGAAAAA.TCA 
TGGGTAGAAT 
AG AT TAT TAT 
CGCAACTGGC 
GAGAAATCTG 
ATATGGGCAT 



ACTCCTTATG 
A Av-r T G AC AA 
AAACA,-.7CAC 
G AGC C AG AAA. 
CA.CAC-A.C-7C-G 



AGCGAGAGCG 
AA.7C7CC-GGA. 
CAA.GGA.CCAA. 
C-AC-AGCAC7C 
TG7AA.7CCAA. 
7G7G7G7GGC 
TTCC7AGCAG 
AAA.TCAGA.TT 
TGGATGAA.TC 
CTGA7CACAT 
GAAGAA.GGAC 
AGGCAAAAGA 
ACACAGGGCA 
AACACCAGAA 
ATGAATCCAA 
AGATCACTAA 
GCAATCATAC 
CTGAGTTGAT 
AACCAAGGGT 
GACCATCCCA 
CATAGATGAC 
AACAATGAGC 
TCTCCGAGAA 
AGAAAGGCCA 
ACATGGATCC 
AAAGGTCAAA 
AGTTACAAGG 
CACTGGAAAT 
AAGTAAGAAG 
AACATCAAAC 
GTTATTTGAC 
ATAGAGGGAT 
TCAATAACTC 
T AA. T A.C AA T A. 
CAGATTCCAA 
CTAAATTTCA 
GTACTCAGTT 
TCTCATTGGG 
TCTATATCAA 
CTATCCCCTA 
CATCAGTTGA 



-AA.G A_AC AAC A 

GGGAA77AAA. 

C AA.T A.T7GC- A. 

A7CCTCAAAG 

A. 7 C C C A.G G G G 

GC AA.T CG AGC 

CA.GAACCA.CA. 

G AC AC AAAA.G 

GA.TTT A.C A.G A. 

TCTGCA.C-CAA 

GAA.TGTCCTA. 

GTTTGATGAT 

CAGAACGAGA 

ACATAGAAGA 

CA.TTA_-.TCTC 

CAACCAGAAC 

AGAGAGAATA 

TCGAGAAAAA 

AACGACACAC 

TGCCACTAGA 

TAATAATCAT 

ATCAACGAAC 

GGACATGTTC 

CAACACCAAG 

ACACACCGAA 

TAAGAAAAAC 

ATCACCAGCT 

TGGCAACATA 

TACCTCATAT 

AGATATCTGG 

AGACAGGTAT 

TTTGTGGCTC 

GATCAGGAAC 

AACTGTAAAG 

CTGAACTATA- 

GCTAACAAGG 

AAAATTCAGG 

TA.TTCAAAAT 

TCAATAAATC 

AGGAGTAGTT 

TGGTTCATCT 

GAATATTGTA 

ATTAGTTGGA 

A.G A.C AT T A.GC 

ATGGATCTGA 

AATTACAAGA 



2 5-3-! 



_-(.•> 
2 7 30 
2 500 
2 = 50 

2 900 
293 : 

3 CCO 
3050 
3100 
3150 
3200 
3250 
3300 
3350 
3400 
3450 
3500 
3550 
3S00 
3S50 
3700 
3750 
3300 
3350 
3900 
3950 
4000 
4050 
4100 
4150 
4200 
4250 
4300 
4350 
4400 

4 4 50 
4500 
4550 
4S00 
4650 
4700 



Figure 2B 



SEQ ID NO: 36 



GTAGA-.GCAA 7TC.CCAGCC TTCATTACCT GGCGAA.rTCA GAT ACT AC ZZ 4 " i : 

AAACATCAiA GCAA.AAGGGG TCC<PGAA_AAT CAGA.CAG7AA A_ACCA.ACA.AC '.. 4 -: V 
CCTGACATCC AACACTGCAA ATCAGGCTAC CCACAGC-AGA AA-AATCA_AAA ' 

ACTTAGGATC AAAGGGATCA CCACAAACCC CGGGAAACAG CC.A.AACC.A.AC -I 

CAACACACAA AT C .AC AG AC A. A.A.AAGG.A.AAA GGCAC7GCAA AGACCGAG.AA 4-50 

C.AA.GCAGAAC GCACACAACC AA.GC AC- AGG A AAGCCAAAGC CCGCCA77CA 3 ■;••:_ 

CA_A.ACA.CACC AACAATCCTA C.AAACAAGCA CCA_A.A-.rAGA GGTC.AAAAGA 50 50 

CAAAGAGCAT CAGATATGAC CATCACAACC ATAATCATAG CC AT ACT ACT 5100 

.AA.TACCCCTA TCATTCTGTC ' AAAT AC-AC AT AAC.AAAACTC- CAACGTGTAC- 5150 

GTC-TATTAGT CAA.CAATCCC A.AA.GGCATGA. .AAATTTCACA .AAA.TTTTGAA 5 200 

ACGAGATACC TGATATTAAG TCTGATACCC AAAA.7 AG AG A ATTCACACTC 5 2 50 

ATGTGGGGAT CA_AC.AGAT.AA. ACCAATACAA C-.A.AGTTATTG GATAGATTGA 5 300 

T.AATTCCTCT ATATGATGGA TT.AA_A.ATT AC A_AA_-_-.G.ATGT A_ATA.GT.AGT A. 5 3 50 

AGTCATG.AAA. CCCAT.AAT.AA. TACTAATCTT AGGAC.AA_A.AC GATTCTTTGG 5^00 

AGAGATAATT GGGAC.AATTG CGATAGGGAT AGCCACCTCA GCGCA.AA.TCA 5 4 50 

CCGCAGCAGT CGCTCTTGTC GA_AGCT.AA_AC AGGC.AAGGTC AGACATAGAA 5 500 

.AAACTCAAA.G AAGCTATAAG AGACACAAAC AA.GGC .AGT AC AA.TCGATTCA 5 5 50 

AAGTTCTGTA GGTAACCTAA TTGTTGCAGT T.AAA.TCAGTT C AA.GACT ATG 5 500 

TCAACAATGA AATTGTACCT TCAATCACAA GATTAGGCTG TGAAC-CAGCA 5 650 

GGGTTACAAT TGGGAATTGC ACTGACACAA CATTACTCAG AATTAACAAA. 5700 

TATATTTGGT GAT AAT AT AG GAACACTGAA AGAAAAAGGG AT AAAAT T AC 5 7 50 

AGGGGATAGC ATCGTTATAT CATACAAACA TAACAGAAAT ATTTACTACT 5800 

TCAACAGTTG ACCAATATGA TATTTATGAC CTATTATTCA CTGAATCAAT 58 50 

CAAGATGAGA GTGATAGATG TTGATTTGAG TGATTACTCA ATTACTCTTC 5900 

AAGTTAGACT TCCTTTATTA ACTAAACTAT CAAATACTCA GATTTATAAA. 5950 

GTAGATTCTA TATCATACAA CATCCAGGGC AAAGAGTGGT ATATTCCTCT 6000 

TCCCAATCAC ATCATGACAA. AAGGGGCTTT TCTAGGTC-GT GCTGATATTA 6050 

AAGAATGCAT AGAGGCATTC AGCAGTTATA TATGTCCTTC TGATCCAGGT 6100 

TATATATTAA ATCACGAGAT AG AG AAT T GT TTATCAGC-GA ACATAACACA 6150 

GTGTCCTAAG ACTGTTGTTA CATCAGATGT GGTACCACGA TACGCGTTTG 6200 

TGAATGGTGG ATTAATTGCA AACTGCATAA- CAACTACATG TACATGCAAT 62 50 

GGAATTGACA ATAGAATTAA TCAATCACCT GATCAAGGAA TTAAGATCAT 6300 

AACACATAAA GAATGCCAGG TAATAGGTAT AAA.CGG.AATG TTATTCAA.TA 6350 

CTAATAGAGA AGGGACATTA GCAACTTATA CATTTGATGA CATTATATTA 6400 

AATAACTCTG TTGCACTTAA TCCAATTGAT ATATCTATGG AA.CTTAA.CAA 6450 • 

GGCAAAACTA GAATTAGAAG AATCGAAGGA ATGGATAAAG AAATC AAATC ' 6500 

AAAAG T T AG A TTCCGTTGGA AGTTGGTATC AATCTAGTGC AACAATCACC 65 50 

ATAATCATAG TGATGATAAT AATTCTATTT ATAATCAATA TAACAATTAT 6600 

TGTAGTCATA ATCAAATTCT ATAGAATTAA GGGGGAAAAT CAAAACGACA 6650 

AAAA.C .AG T G A GCCGTATATA CTGAC.AAATA GACAATAA.G.A CT.ATA.CACGA 6700 

T C AAAT A.T AG AAA.GTACAAA AAACTTAGGA ACAAAGTTGT TCAACACAGC 67 50 

AGCAGCGAA.C AGACCCAAA.G GCAGCGCAGA GGCGACACCG A-ACCCAAAAA 6300 

TGGAATATTG GAAACACACA AACAGCACAA AAAACACCAA. CAA.TGAAACC 63 50 

GA_AACAA.CC A. GAGGCAAACA CAGTAGCAAG GTTACAAATA TCATAATGTA 6900 

CACCTTCTGG ACAATAACAT . CAACAATATT ATTAGTCATT TTTATAA.TGA. 6950 

TATTGACAAA CTTAATTCAA GAGAACAATC .AT AAT AAAT T AATGTTGCAG 7000 

GAAATAAGAA AAGAATTCGC GGCAATAGAC ACCAAGATTC AGAGGACCTC 7 050 



Figure 2C 



SEQ ID NO: 
GGA7GACA77 
CAA ~7C AC-AG 
A7GTCAGATC 
T C AAG A.AGT G 
TAAATCCAGA 
AGT AGTCCT A 
7 AG T A.G AGT A 
TAATCTACGC 
GGGAAATCTT 
CGTAGTACCT 
ATAATAGGAA 
TTATGC7CAA 
TA.7TGAGGAT 
C AA.C AA.GGT T 
TTGTATCCAT 
TC7CGGATAT 
ATACAACTGG 
TATAGCCCAT 
TGATAAAGGC 
TGAGCCAAAA 
AGAATATACA 
AGGGGTAATT 
GGCATAATGT 
TCATGCCCAG 
AAACCCATCG 
CTAGAGAAAA 
GAATTAGCTA 
TTGTATCACA 
ATCACAGAAG 
CCAAAAAACT 
TTAAAAGAGA 
GC-AATAATAA 
ATCCCACAGC 
ATTCTCCTAT 
TTGCCCCAAC 
ACAAAAAATC 
AATTAAGATC 
TTTATCAGAT 
CGG AATTAAT 
AT AAT C AAAT 
AAGTTAGCAT 
TAGCAA.TA.TA 
ATCCA.TTCAA 
AAAGCCAAAA 
AGAAGACCAA 
TAGATAAACA 
ATGTATTGTG 



36 

GG AACCTCAA 
TCA7GTTCAA 
TCAG.AAAATT 
CCA_ATA.CA.GA 
CAAGTTCTGG 
AGATAAGGTT 
AA7GGC7GT A 
TTACACCTCT 
ACCAAGTACT 
G A.T T T AAAT C 
ATCTTGCTCT 
C AC C AAAAG T 
ATTGTACTTG 
TACAAATAAT 
CAGTAGGACC 
GGAGGTCTAG 
TTGTCCTGGC 
GGTTCTCAAA 
ATAGATGCAA 
TTATTGGGGA 
TATATACTAG 
GATATTTCTG 
ACTATCACGG 
ACGGATGTAT 
GGGAGTGTTG 
CCCAATCATT 
TATATAACAG 
CATTATGATA 
TTTGAATACG 
GCAGCTAAAT 
CCACCAGACA 
AAAACTTAGG 
GGCA.CAACAT 
AGTTAAAGGA 
CCTACGATAT 
AAACTCAATA 
AGTCTTAATG 
AT C C AG AAAT 
AATAAAATAA 
GACTGATGGA 
CGAAAAATGA 
TCAAATGTTC 
GACATGGTTT 
ATGAGATTAC 
AAG AAT AT AT 
AAATTACAAT 
ATGTAGTTGA 



TACAGT-: - , _-^-.- 
AAC7A7A7CC 
TAIC.^TGAT 
GAA7GAC7CA 
A.G G T G 7 AG A T 
AATACCAGGG 
TTAGAATGCG 
AA7CTTA7CA 
A.CAAATA.GGG 
CCAGAGTCAC 
CTGGGACTAT 
TGA.7GAGAGA 
ACATTGTCAC 
AA.T A.T A.AC T T 
AGGAATCTAT 
A.GC A.T G AAG A. 
AAAACACAGA 
TAGGAGAATG 
CTTTTAGCTT 
TCACAAGG.AA 
ATCCACAAGT 
AT T AT AAT AA 
CCAGGAAATG 
AACAGGAGTT 
TATCATCAGT 
ACCTACTCAA 
AACACTTCCA 
AAGGGTATTG 
TTTCAACCTA 
TGATCATCGC 
GACAACACAG 
AGAAAAGTGT 
CTGACATTCT 
AAAATAGCAC 
GGATGATGAT 
AATTAGATAA 
GAAAG AGT AA 
GTCTA.GTGAA 
ATGAATTGCT 
T T AAG AG A.T C 
TGGAAGTAAT 
ACATGACTTA 
AC T A.T T AA.GT 
ATTCAATAGG 
TGCTGATACA 
GGGTATATAA 
AGGGAGGTGG 



.-A 7 AAA 7 AC A 

CAC7ATCAC7 

C7AACA-ATA 

7 G AT AG A.G G 7 

C7GGTA^.CCC 

CC AGG7 7 7 A 7 

A T C G 7 7 A : — * 

CCCAGGGC7G 

A7 AAT 7 AC 7 A 

AC A.T A.C AT T 7 

7 G AA.T AC AG A. 

TCCGATTATG 

TAATAATGGA 

T T G A.T AAAC C 

TATAAGGGTA 

AAACGGAGAC 

GAGACTGTAA 

G T AAAC T C T A. 

GAGGGTGTGG 

GATTACTTTT 

TGGCACAGTA 

TATAAGAATA 

ATGAATGTCC 

TACACTGATG 

AA.TTCTTGAC 

CAGCTACAAA 

GCTGCATATA 

TTTTCA.TA.TA. 

TGTTATTCAA. 

ATATCGGATG 

GAGATGATGC 

GCAA.GAAAAA 

GTACCCTGAA 

AACTGCATAC 

TCAATA.CTGA 

AAGACAACGG 

ATGATCTTGG 

ATGTTCCAAT 

AAGTAAAGCA 

TATGGGTTAC 

TA.TGATATCA 

C C AA.T C A.G A.C 

A.T G AC A.T G A.G 

CAT AAA.G A.T T 

TCCAGAACTC 

TGACTCCTGA 

AATATAAGTT 



AGAC77.C7CA 




AA.C AC AA'_ AA 


- * - ■ 


AAAGA.GAACA. 


> 


A7AGAACCCC 




A. 7 C 7 C 7 .AA.C A. 


. J w - 


7 A.GC AA.C ATC 


7 ^ * 


A7CAA7CA77 


' T : - ^- 


7CAAAA.7A7A 


7 4 5 : 


7AAATTCGGA. 


"so: 


.AAT AT T G .AT G 


73 30 


7GTTTA7CAG 


7 50 0 


C A.T C AA.C AGG 


7£30 


T T AA.T T AT AA. 


7 700 


GTATGCAGCA 


77 30 


AA.GTT AT ATT 


7300 


G T AAT A.T G T A. 


7 5 50 


TCAGGCTTCT 


7900 


TTATTGTTGT 


7 9 50 


ACTATTCCAA 


3000 


ATTAGGTGAC 


8050 


AATTACAGTT 


8100 


AATTGGACTT 


3150 


ATGGGGTCAT 


8200 


CATATCCGCT 


8250 


TCACAAAAGT 


8300 


TAGAATAAAT 


8350 


CAACAACAAA 


8400 


GTAGAAATAA 


8450 


AACA.GAAGTT 


8500 


CCAGATGACA. 


8550 


AAGATATAAA 


8S00 


TGGACACTGA 


8550 


TGTCACCTCA 


8700 


AATAATGAGT 


8750 


TTATTACTAG 


8800 


TCAATTAGGA 


88 50 


TAAATACACC 


3S00 


TATGTATACC 


8950 


A.G T AAAAC A.T 


9000 


TGTA.CTA.TCG 


9050 


ATGAAGATAT 


9100 


AAA.TGGT.ATA. 


9150 


GAGATTACAA 


9200 


ATAATCTATT 


9250 


GTCTTAATAT 


9300 


ATTGGTACTA 


9350 


CATGTGCAAA 


9400 



Figure 2D 



SEQ ID NO: 
A T T A. T C C T 
AAA 7 AA7 A.G A 
A7 AT C AC 7 A. 7 
G G 7 7 AAAC A.G 
AA7CAA7A77 
7A7A7AGA.7A 
AGCAGAAA77 
CGAG7A7AGC 
77GAAA777G 
7A7AAA7GGA 
7ACC7A77CA 
GCC AT AT CAT 
AAAAT 7 7 G A.G 
A7ATGAAAGA 
7A7CCAGG77 
AAGA77GG77 
7A77AGA77A 
ATCTCATATA 
7GCAAAAATG 
TAT7GGCGAA 
GG AG AAATTG 
TCCGCGGTA7 
TTCAAGCTTA 
T C AAAGAAGT 
AACCGTAAGC 
GGAGGTATGA 
GGGTTAAAGG 
AATATATGTT 
CACTTGATGA 
GGAATAGAAG 
CATACATTTA 
AAGGGGATAA 
GATTATAAGG 
TGATTCTTTG 
ATGAAACTAT 
TATGACGGAA. 
TTTTTGGTCT 
TGGCGACATC 
GGATATGTAT 
TGGAATGAAT 
TCA.GGAATAT 
GGAGGA7TTA. 
AGATCCTACA 
ATTTG7TAGA 
GAG7CCTCCT 
ACAATCTCAA 
TACTACAGGA 



36 



AAA T 7 AC AA 7 
7GGAC7A7TC 
7AGAACCGC7 
C7C AG AGGGG 
CGCAGC7GAG 
AAA7 777 AG A. 
77C7C777C7 
AGC AG AG AAA. 
A7AC7ATCAA 
7A7AGAGAAA 
7GCACA7GAA 
A T G A AAA T G C 
A A G 7 7 7 A 7 A,G 
7AAAGCA77A 
CAAACCTGTT 
GAAG7A77TA 
CG7AGAA7CA 
G7TTAAAAGA 
ACATACAAGA 
TAATATAGGG 
AATTACTCAA 
AATGAGGTAT 
TAATGCAATT 
TTGAATTTAA 
7GCTTCTTAA 
ATCAACAGCT 
AATTATTTAA 
GGAGATCCTT 
CCATCCTGAT 
GGTTTTGCCA 
GCAGCTGTCA 
TCAAGCCATA 
T7AAGAAAGA 
AGAGAGGTTA 
AATAAGTAGT 
GAATCCTTCC 
GAAACAATCA 
GTTTGCAAAG 
GCTCAATCTT 
A. 7 AAA 7 C C A A 
TCA77GGATG 
A7TATA7GGC 
G7CGC7GCAT 
TCGAGGTGTC 
TTTTAGACTG 
AATATAACCA 
C7CACCAAAC 



CAATC-7A77A 

C7GACC77AG 

TGCATTATCC- 

C777777AAA 

7G7ACAACAG 

7G7A77CAAA 

7CCGAAC777 

GTTAGAAAG7 

TAAATGTCA7 

GACA.7GGTGG 

T77A7CATAA 

7G7AGATTAT 

AGCC7CAATT 

TCCCCAAAGA 

A.TACCGCAC7 

TAGCAGATAG 

GGA7A77GGC 

G AAA.G AAAT A. 

TGAGAGCTAC 

AAATTCTTCC 

GAGAC7GACA 

ACAATAAT7C 

AGCAGTTCCA 

ATCAACAGAT 

CGACAGATC7 

TTATTCGGTG 

TTGGCTGCAC 

ATTGCCCGCC 

TCAGGATT7T 

AAAGTTATGG 

AAATCGGTGT 

GCTG7TACCA 

GATTGTTTAT 

TGGATGATCT 

AAAATG7TTA 

TCAGGCGT7A 

TAGATGAGAC 

GCCATTGAGA 

CAAAAATATC 

CTA.TAACCCA 

CAATA7GCA7 

CATGTCAAGG 

TAGCTGATAT 

CTTTACAGAA 

GGCT7CAGAC 

CCATGA7AAA 

CCA77AC7AT 



. AAA"_:vj . AA'_ 

GAGAAAGAAC 

CTCATTCAAA 

TCAGGTGTTA 

AC^-AAATACC 

GAA7CAACAA 

lGGA.CA.l.v-C - 

A7A7G7ACAC 

GC7AT77777 

7CAA7GGCC7 

A7GCG7ACGG 

7 A. 7 A AG A.G C 7 

GGA7GAAGAC 

AA7C7AAC7G 

AA7G7G7C7C 

7AAAT77GA.7 

7AGA7GATCC 

AAACAAGAAG 

ACAAGTATTA 

AAGAGAATGG 

AC.AATA7C7A 

AAAAAGTCAC 

A.7TTA.TCTTC 

ATATACAATG 

TAAAAAATA.T 

ATACTTGTAA 

CCTCGGCTTG 

ATCAGATATT. 

A7G7TCATAA 

ACACTCATAT 

AAGAGTTACT 

CCA.GAGTACC 

AAAGATGTGG 

GGGTC AT GAG 

TA.TATAGCAA 

AAAGCATTGT 

AAGATCAGCA 

A7GGCTACTC 

CAACAGTTGT 

AAA.7 AT 7 AAA 

C7CTAATCCC 

7G77T7G7CA. 

7AAAAGATTT 

TTATGAATCA 

CCC7A7TCAT 

GAATATAACT 

C7GGAT7A77 



AAlTTATGGG 
A 7 7 7 G A C A. 7 A 
C7CA7GACCC 
T C A G AAA 7 G G 
7AA7G7GGA7 
7 AG A 7G AAA 7 
C C AT T A.G A.G G 
TGAG.AAATGT 
G7ACAA7AA7 
CCAGTTACAT 
A. T C AAA 7 7 C 7 
TCATAGGAAT 
TTAACTATTT 
GGA.CACAG7C 
ATGATTCACG 
CCCCACCAAG 
7 G AAT 7 7 AAT 
G7AGACT77T 
TCAGAAACAT 
GATGGTTAAA 

TGTCTGGGGT 
ACAGAGGAAC 
TAATCAGAAG 
ATGGATACGA 
TGTTTAAATT 
TCAGATATTT 
AAAAGAGTAC 
GAACATTTA.C 
7CC7AAAGGA 
C7ATCAGTGC 
GCAATGG7TC 
TAATAA77A7 
TAAGA7TTTT 
CTCAAAC7AA 
AAGGATATAC 
CTAGATGTGT- 
TCCTCAAATC 
ACCTGTATTG 
ATATAGCA.C7 
GA.7CAA7A77 
TGCTAGTGTC 
GAAACA7TGG 
ATAAAAGCAA. 
GGAA.CCAGGC 
GTAACTTACC 
GCAAGAAATG 
TACAAGTACA 



^5 



3* C U W 

95 50 
9900 
99 50 

100 00 
IOC 50 
L0100 

101 = 0 
10200 
10250 
10300 
10350 
10400 
10450 
10500 
10550 
10600 
10650 
10700 
10750 
10800 
10850 
10S00 
10950 
11000 
11050 
11100 
11150 
11200 
11250 
11300 
11350 
11400 
11450 
11500 
11550 
11600 
11650 
11700 
11750 



Figure 2E 



ATGATACAAo 
AA.TTCTCCCA 
TTAGGAJVtGC 
G7AGGGATAA 
CAACTATGAT 
TAGTCAGTGA 



SEQ ID NO: 36 



A.T AT C A. T T AA 
A.AATGGACTT 
TA-ACAGGATC 
CCATA7A.CAT 
AAjTA.GGAJVTA 
AGAGATCTGA 
AAGGCTGCTA 
C G AAAT AT C T 
TTACATTAGA 
CTATCACA.TA 
ATCACTTATT 
CTATTAAAGA 
GTAATGTTAA 
GAGTACAGGA 
GTTGTATCAA 
TTAGAGTTAA 
CCCTTTAAAG 
CTTATACAAT 
ATATCAATAT 
TCGGGATAAT 
ACAGTCTGAT 
ACATTTGGAG 
GAAAATAGAA 
AAGACACCTC 
CCAAAAGTGT 
TGGTCCTAAT 
GCGAGTACTC 
CTTGAGATCT 
AC AAGC AT T T 
TAGCATCTTT 
GACGAATTAA 
TAAATATGTG 
TTACGTATGT 
AATCCGCCTG 
CTTAGACAAT 
AGAGAAATAA 
GTCGTAAAAA 
GAATGTTACT 
CACATCAGCT 
CTTGAATTGT 
ACTCTTTTTA 



A v_- G A T G A G G A 
AGGGTTGCGC 
TATAGC7GG7 
ACAGAGGAGG 
CTTGTACAAT 
C .-.-.GAT T AAG 
G AC AJ*J\AAAT 
GAAACTCCA.G 
TGAGCATTGT 
GGATGTATTT 
GCATCATTAA 
AGC AC AAT T G 
TAAGAATAGC 
TGGATGGAAG 
TAGCTTAAAG 
GGTTAAAAGA 
AGAGTAAGCA 
GGCAAATGAA 
CAGGGTTAAG 
CATAACCCTA 
AGAGAGTTAC 
TTAAATACCC 
GATATAGATC 
TGACATGAAT 
GTACTGCAGT 
CTTAAGGAGC 
AACTGAATTT 
GGTTA.CTCGT 
GGAAGGGATC 
ACATTCAGTA 
TTAAGAGATT 
ACTGCTAGTC 
CTTGGATCTA 
ATATCTGTGA 
CTCTCAAGAC 
TGGACCAAAT 
AACAATACTT 
CAAGTATCAG 
GAGGAAAACT 
AAACGATTGA 
ATTGTTAAAA 
AAGTAGTTAT 
TAAGATCCAT 
ACACATGGAA 
GAGGTTATTT 
CACAAATTTT 
GGAGAAGGAG 



ATGACATTTT 
ATGTTGGATA 
ATTAACCTAT 

• - : v^.-.v^.-.'_.-.v_ :. 

TATGAAGATA 
G7GGATGCAT 
ATCCT7TAGA 
AGGATATGT7 
A.CCAGGCAAT 
GGGTCCCT7A 
GGG7A7ATCA 

CATCACAGAT 
A T T T T G AC A C 
TACTGCTACT 
GGTTCA7CAC 
ACTAAAGATA 
TGTATTTGAA 
TGGTCATGCA 
AATGATGAGC 
TGAGAGTAAT 
TA.TCAAAATT 
TACTGGGACG 
TACAATAGCA 
TGGTTGTAAT 
CTGACCCTAG 
GAATCAATTT 
CCATTTGGGA 
CTTAAAGTAT 
TTGGGA.TTGT 
AGGACCAAGT 
TTTATGAGAG 
TAGTGACATG 
ACCTTGCCTT 
TTATTAAATC 
GGATCTGAAC 
GACTGTTAAT 
GCGATTAAGT 
AGATTGGGAT 
CTGTAAATGA 
TTCTGGGGAT 
AACGAGTGAT 
TGACACTTCC 
GGAGTAAACA 
AATGAGGGAA 
CAGGAGCTAT 



■.•-•A.AA-- - <_ 



C AAC A.- A. 1 . 7 C 
AA.CTT ATT AA. 
T AG T AAA.AC T 

C-T7ACTGTCT 
ATTCAA.CTGA 



ACT7GGGCA7 

TGCACAAACA 

CAGTGACAAC 

C AG A.T G AAAT 

AAT ATCTAA.T 

CAAATCTTAT 

TATCTATTTA 

TCTACATATA 

ATATCAATCC 

G AATT T AT A.T 

AAT GGTT AT A 

ACACAGATAT 

GATACAATGT 

TGCAAATGAT 

ATATACTAGT 

GCATATACCC 

TTATATAATG 

TA.TCTAATGC 

GGAGTTTTGA 

TAAGCTTGCT 

AATGGCTGAA 

GAAATAGCAA 

TGTGTGTTGT 

TAACATATCT 

A.TCAAAGAAG 

TAAATCATTC 

A.TCTGAGGAT 

CCCATAGAAG 

CAATTGCAGT 

TAGCTCTAAA 

TCTGAAGTTA 

TCAGGGAGGA 

GTACAAGTTG 

GTTAAAAAAG 

GTTAGCATGT 



G AAA.G A. T AA.G 
T T AAG A.G T AA 
AGACCTAGCC 
G AA.G AA.T G A.T 
GG AGT AAT AA. 

^ \^ l \s \_ 

GATCAGCTC-A. 

GTT ACGGA.TG 

CAAACCAGCT 

7TGGGAATGA 

CGTGCG.AACT 

AT C AAC AAAT 

TTTCTAGTAC 

G A.T AAT A.T GT 

TTATCAACAG 

GGTTAGAGGA 

GAGGATGGAT 

GGAGTCTACA 

ATGATAAGGA 

AGAGATCATT 

TGTACATGCA 

CGCAGCTAGA 

GATGATATTA 

GTTTCTCAAA. 

TTTATGGATT 

AGAACATTAA 

AC T AT C T C A.T 

ATCCTATTTA 

CTCTCAATTT 

TGGAGCATCA. 

ATGATAGAAG 

TTAGCAGAGA 

AGAGAGACTT* 

ATCCTACTCT 

CCCTCAACTG 

TCGTGGCATT 

ATGAGAATA.T 

G A.T AAT C AAA 

GAATTATCAA 

A.TGAAGCTTC 

AGTTA.TCTA.T 

TCTGAAAGCT 

ATAAAGATAG 

TATGATGCTA 



2130 
2 200 
2 250 
2 j 0 0 

7 ~ = 



.2550 
.2600 
12550 
12700 
12750 
12300 
12350 
12900 
12S50 
13000 
13050 
13100 
13150 
13200 
13250 
13300 
13350 
13400 
13450 
13500 
13550 
13S00 
13550 
13700 
13750 
13300 
13350 
13900 
13950 
14000 
14050 
14100 



Figure 2F 




SEQ ID NO: 36 
C ACT CGGTC C 7 GC AA7 AAA T 
GTAATTGGTC AACGGGAATT 
ACGTAAAAAA CTAGGAAATG 
T AT T T AA.T GG GAATCCCAAT 
AGTTTAATAT GGAGTGAATT 
TGACATGGAG GGAGCAATAG 
ATTATAGTA7 TATTAGGATT 
CTAGTATCAA AAA T T AT AC C 
CTATCTATAC AGGTTGTATT 
CATCCAATCC TGCCTCAACA 
TGTACTGTAA TGGAACCCAG 
ATCATCAGTA GAAGAAAATA 
GGAAGAACAA CGAATGGTTA 
TATGGGATAA TGAGGCCATA 
AATTAACTTA AATCACTTAG 
C C AAC AT T A_A TAATATAATT 
ATGTTCGAAT GGGTCAATAT 
AGGAAGATAT AATCTATTCC 
TATCACGAAG ATTAGTACTA 
TTACTGACAG GCCGTTTCCC 
CGGATATGTA TCATTGGCTG 
CAAGAAATAT TGTCAAAAGT 
TGGTTTTTAA CCAAAGAGGT 
CAAACTACTA GGAATTCCCA 
TTCAGGGTTA TGAATATGAT 
ACAAAAAATA. AAACACCTAA 
AAGTAAGAAA AACA.TATAAT 
TTTGGT 



i A ; : Aa_ A A 7 . C - 3GT TT AAA. 
AAA AA T C T T C GCATCAGAAG 
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